Random forests versus Neural Networks - What's best for camera localization?

نویسندگان

  • Daniela Massiceti
  • Alexander Krull
  • Eric Brachmann
  • Carsten Rother
  • Philip H. S. Torr
چکیده

This work addresses the task of camera localization in a known 3D scene given a single input RGB image. State-of-the-art approaches accomplish this in two steps: firstly, regressing for every pixel in the image its 3D scene coordinate and subsequently, using these coordinates to estimate the final 6D camera pose via RANSAC. To solve the first step, Random Forests (RFs) are typically used. On the other hand, Neural Networks (NNs) reign in many dense regression tasks, but are not test-time efficient. We ask the question: which of the two is best for camera localization? To address this, we make two method contributions: (1) a test-time efficient NN architecture which we term a ForestNet that is derived and initialized from a RF, and (2) a new fully-differentiable robust averaging technique for regression ensembles which can be trained endto-end with a NN. Our experimental findings show that for scene coordinate regression, traditional NN architectures are superior to test-time efficient RFs and ForestNets, however, this does not translate to final 6D camera pose accuracy where RFs and ForestNets perform slightly better. To summarize, our best method, a ForestNet with a robust average, which has an equivalent fast and lightweight RF, improves over the stateof-the-art for camera localization on the 7-Scenes dataset [1]. While this work focuses on scene coordinate regression for camera localization, our innovations may also be applied to other continuous regression tasks.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Random Forests versus Neural Networks - What's Best for Camera Relocalization?

This work addresses the task of camera localization in a known 3D scene, given a single input RGB image. State-of-the-art approaches accomplish this with two steps. Firstly, regressing for every pixel in the image its so-called 3D scene coordinate and, subsequently, using those coordinates to estimate the final 6D camera pose via RANSAC. To solve the first step, Random Forests (RFs) are typical...

متن کامل

Full-Frame Scene Coordinate Regression for Image-Based Localization

Image-based localization, or camera relocalization, is a fundamental problem in computer vision and robotics, and it refers to estimating camera pose from an image. Recent state-ofthe-art approaches use learning based methods, such as Random Forests (RFs) and Convolutional Neural Networks (CNNs), to regress for each pixel in the image its corresponding position in the scene’s world coordinate f...

متن کامل

Are Random Forests Truly the Best Classifiers?

The JMLR study Do we need hundreds of classifiers to solve real world classification problems? benchmarks 179 classifiers in 17 families on 121 data sets from the UCI repository and claims that “the random forest is clearly the best family of classifier”. In this response, we show that the study’s results are biased by the lack of a held-out test set and the exclusion of trials with errors. Fur...

متن کامل

Searching Sexual Predators in Social Network Dialogues

In this paper we propose a two-step technique for detecting sexual predators from social network dialogues. One step for detecting dialogues in which a sexual predators participates, and the second step is for detecting, from the whole dialogue users, the one that is the sexual predator. From the three different supervised classifier employed, Random Forests obtained the best results in the fir...

متن کامل

Hand Gesture Recognition with Batch and Reinforcement Learning

In this paper, we present a system for real-time recognition of user-defined static hand gestures captured via a traditional web camera. We use SURF descriptors to get the bag-of-visual-words features of the user’s hand, and use these features to train a multi-class supervised learning model. We choose the best learning model from (SVM, Neural Networks, Decision Trees, and Random Forests) and t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017